Using interaction data for improving the offline and online evaluation of search engines
نویسنده
چکیده
This thesis investigates how web search evaluation can be improved using historical interaction data. Modern search engines combine offline and online evaluation approaches in a sequence of steps that a tested change needs to pass through to be accepted as an improvement and subsequently deployed. We refer to such a sequence of steps as an evaluation pipeline. In this thesis, we consider the evaluation pipeline to contain three sequential steps: an offline evaluation step, an online evaluation scheduling step, and an online evaluation step. In this thesis we show that historical user interaction data can aid in improving the accuracy or efficiency of each of the steps of the web search evaluation pipeline. As a result of these improvements, the overall efficiency of the entire evaluation pipeline is increased. Firstly, we investigate how user interaction data can be used to build accurate offline evaluation methods for query auto-completion mechanisms. We propose a family of offline evaluation metrics for query auto-completion that represents the effort the user has to spend in order to submit their query. The parameters of our proposed metrics are trained against a set of user interactions recorded in the search engine’s query logs. From our experimental study, we observe that our proposed metrics are significantly more correlated with an online user satisfaction indicator than the metrics proposed in the existing literature. Hence, fewer changes will pass the offline evaluation step to be rejected after the online evaluation step. As a result, this would allow us to achieve a higher efficiency of the entire evaluation pipeline. Secondly, we state the problem of the optimised scheduling of online experiments. We tackle this problem by considering a greedy scheduler that prioritises the evaluation queue according to the predicted likelihood of success of a particular experiment. This predictor is trained on a set of online experiments, and uses a diverse set of features to represent an online experiment. Our study demonstrates that a higher number of successful experiments per unit of time can be achieved by deploying such a scheduler on the second step of the evaluation pipeline. Consequently, we argue that the efficiency of the evaluation pipeline can be increased.
منابع مشابه
A Technique for Improving Web Mining using Enhanced Genetic Algorithm
World Wide Web is growing at a very fast pace and makes a lot of information available to the public. Search engines used conventional methods to retrieve information on the Web; however, the search results of these engines are still able to be refined and their accuracy is not high enough. One of the methods for web mining is evolutionary algorithms which search according to the user interests...
متن کاملExplaining the perception and experience of faculty members of Mashhad University of Medical Sciences of virtual education during the covid-19 epidemic
Background & Aim:Considering that recognizing the experiences of professors and students is important in improving the quality of Virtual educationthe present study aims to explain , the understanding and experience of faculty members of Mashhad University of Medical Sciences of virtual education during the covid-19 epidemic. Methods: This qualitative study was conducted with a content analys...
متن کاملReview of ranked-based and unranked-based metrics for determining the effectiveness of search engines
Purpose: Traditionally, there have many metrics for evaluating the search engine, nevertheless various researchers’ proposed new metrics in recent years. Aware of this new metrics is essential to conduct research on evaluation of the search engine field. So, the purpose of this study was to provide an analysis of important and new metrics for evaluating the search engines. Methodology: This is ...
متن کاملبررسی میزان سواد اطّلاعاتی کتابداران کتابخانههای عمومی استان خوزستان به منظور شناسایی نقاط قوّت یا ضعف احتمالی آنها در این زمینه
The present research studies the level of information literacy among librarians of public libraries in Khuzestan province with the purpose of identifying their potential strengths and weaknesses. The statistical population in this study includes all librarians of public libraries in Khuzestan province. The findings indicate that the mentioned librarians are at desirable level in following skill...
متن کاملتاثیر شبکههای ارتباطی محیط کاری برخط و برونخط بر عملکرد شغلی کارکنان
Communication has always been one of the most important factors of organizational success. Employees’ ties in online and offline workplace communication networks are complementary resources whose interaction can influence their job performance. Network researches in organizations show that network characteristics have significant effect on employees’ and organizational performance. ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2016